Title: Strategy for modeling a Pavement Performance Analysis System at WisDOT
نویسندگان
چکیده
One of the most difficult tasks in pavement management information systems is establishing the links between performance measures of a structure and the design and construction inputs. During the last two decades Wisconsin Department of Transportation has made significant steps in improving material properties, design practices, and monitoring of pavement performance. The databases that are available at the DOT include detailed material properties, detailed pavement structure information, and detailed pavement indicators and performance measures. In-situ pavement performance can be considered a response variable to many project input variables, such as design, construction, and traffic loading effects. If we are to fully understand the component of pavement performance and specify the inputs through design and construction specifications to achieve that performance we must develop quantitative relationship between input and response variables through a scientific, fully integrated Pavement Performance Analysis System (PPAS). A comprehensive and fully integrated data acquisition, modeling, and analysis system is necessary to quantify the relationship between design and construction inputs, and the resulting in-situ pavement performance output. Hence, the objective of this study is to design a database model required for developing an effective database template that will allow analysis of pavement performance measures based on design and construction information linked by location. In order to select the most appropriate database model, a conceptual database model (Entity Relationship Model) and dimensional model, which is believed to be the most effective modeling technique for data warehouse project, are designed and compared. TRB 2004 Annual Meeting CD-ROM Paper revised from original submittal. Jae-ho Choi and Teresa M. Adams 2 INTRODUCTION One of the most difficult tasks in pavement information systems is establishing the links between performance measures of a structure and the design and construction inputs. During the last two decades, Wisconsin Department of Transportation has made significant progress in improving material properties, design practices, and monitoring of pavement performance. The databases that are available at the DOT include detailed material properties, detailed pavement structure information, and detailed pavement indicators and performance measures. One of the most significant achievements of State Highway Research Program (SHRP) was developing new testing systems that relate mechanistic properties of materials and mixture design methods to in-situ pavement performance. Mechanistic properties are defined through new methods and standardized system for incorporating material, environmental, and other engineering properties. The results of SHRP could be utilized effectively if a Pavement Performance Analysis System (PPAS) is developed. In-situ pavement performance can be considered as the response variable given project inputs, such as design, construction, and environmental and traffic loading. The existing processes used by WisDOT and other highway agencies to determine these relationships have been largely based on a collection of experiences and knowledge acquired through years of pavement performance monitoring and continuous specification development. In order to fully understand the component of pavement performance and specify the inputs through specifications to achieve that performance we must develop quantitative relationship between input variables and response variables through the scientific, fully integrated PPAS. Portions of the system are already in-place, but research is necessary to develop the database model for a PPAS which will allow analysis of pavement performance measures based on design and construction information linked by location. This study divides the research attempts regarding developing a PPAS into two major categories: the conceptual and the logical field. In the conceptual field, the major issue is to design conceptual modeling formalisms (e.g., Entity Relationship Model) using the network overlay techniques. In the logical field, the research focuses on the development of a logical, implementation independence model (e.g., Dimensional Model) to describe multidimensional data structures, in which the data can be seen as depending on several independent variables. Finally, the comparison between these two models is illustrated to recommend a more appropriate database model for implementing PPAS database template. OVERVIEW MODEL OF A PAVEMENT PERFORMANCE ANALYSIS SYSTEM An overview of the existing databases and a vision for how to link them to achieve the objectives of this study were established after discussions with the DOT engineers. Figure 1 is a schematic representation of how different sources of data located in different systems can be linked to develop a fully integrated PPAS. The upper part of Figure 1 indicates the application systems used by WisDOT. The source files include pavement structure data, design JMF (Job Mixture Formula) data, material QC/QA testing data, performance measures, traffic, and aggregate source testing data. The pavement structure data, JMF, and material QC/QA data are grouped because these sources are subjected to the modification based on the feedback from PPAS. Nyerges (1) explains that most transportation information can be referenced using a combination of three schemes: road name and milepoint, control section, and chain (link) and node. Based on this classification, Layer and Base (L&B) database and Pavement Information Files (PIFs) use a combination of road name and reference point (pseudo milepoint) scheme, and control section scheme, respectively. Table 1 shows the summary of location reference schemes of databases, which is vital information in integrating the extracted datasets through location referencing. FROM_RP and TO_RP are expressed in reference points whereas FROM_PLUS and TO_PLUS are expressed in miles. If the location of FROM_RP of a project is known in miles through the conversion process from reference point to milepoint, summing up the location of FROM_RP and FROM_PLUS will identify the location of the starting point of a project. DESIGNING A CONCEPTUAL DATABASE MODEL This section presents a conceptual database design for PPAS that allows extracted datasets from the existing databases to be related in order to facilitate the pavement performance modeling with respect to material and construction variables. The design of the extracted database should be driven by a clear statement of the ways the database is to be used in the proposed modeling framework. Without a clear statement of the intended use it is not TRB 2004 Annual Meeting CD-ROM Paper revised from original submittal. Jae-ho Choi and Teresa M. Adams 3 possible to evaluate the usefulness and correctness of the proposed database design. Therefore, five important objectives of the PPAS were established based on the literature search and discussions with the DOT engineers. The important queries for PPAS can be surrogates for illustrating functional requirements of the PPAS. They are as follows: 1. Determine the effects of specific design features on pavement performance 2. Determine the effects of specific construction features on pavement performance 3. Determine the relationship between material properties and distress types 4. Determine the effectiveness of pavement maintenance treatment 5. Develop mechanisms to re-calibrate coefficients of the PPAS model based on comparisons between predicted and actual measured values In addition to these selected objectives, the relationship between as-constructed material and construction input variables, and pavement performance is one of the main challenges involved in estimating future pavement performance. To establish sampling and testing components of variance of quality characteristics for specification improvement, PPAS should be able to correlate quality characteristics with distress over time. Such a system is difficult to achieve using the existing WisDOT information systems since the current QC/QA split sample location is based on the tonnage at the point of sampling (quantity) not on highway location references (2). This requires a plan for including location references in the current quality management section database. Figure 2 shows an example illustrating how the proposed PPAS can facilitate the analysis for identifying the relationship between as-constructed QC/QA data and field performance. If the design target for asphalt content is 5.0%, and there are significant deviations from the target in several pavement sections where raveling occurred, the regression or Analysis of Variance (ANOVA) models would detect some relationship. It may be learned that certain deviations from target are perfectly acceptable, while other deviations are critical to performance. Since the traffic levels, temperature, and base condition are consistent, these variables could be blocked form the analysis. Then those variables that changed within the project could be evaluated (3). Developing a Strategy for relating the Component Datasets Figure 3 shows a high-level schematic of a strategy for linking the separate databases. “A”, “B”, and “C” in figure 3 indicates components currently lacking in the existing pavement information systems used by WisDOT. “A” indicates that material properties cannot be related to performance measures. “B” indicates that QC/QA data cannot be related to distress data. “C” indicates that field distress data are not available based on sampling location. This is because current method for sampling is mostly based on the quantity of material placed. In pavement information systems used by WisDOT, the location schemes of L&B, PIF, and traffic databases are linear-oriented (the metric for reference is along the length of a highway) (1), whereas the MTS database does not use any location referencing schemes. As shown in figure 3, L&B, PIFs, and traffic databases can be related each other using network overlay techniques. The L&B database plays a central role to link material data (Job Mixture Formula) to performance measures because this database includes a common attribute (Project Identification Number) based on which material data and performance measures can be related. Network Overlay Process to relate Material Data, Traffic Data, and Performance Measures In general, different application groups in an organization have difference control sections, one for pavement construction management (e.g., L&B database) and another for highway performance monitoring (e.g., PIFs). Each section is homogeneous with respect to the attribute of interest to that application group. Therefore, when data is to be integrated form different records (e.g., L&B and PIFs) then network overlay techniques using ordered milepoints can derive the common attribute values needed as was indicated by Nyerges (1990). Since L&B database and PIFs use a reference point method to reference the control section, conversion process from reference point to milepoint (mileages) is necessary to be used in the network overlay process. Figure 4 illustrates the network overlay process used to relate material and construction data in L&B database and performance measures in PIFs given the assumption that reference scheme was converted to milepoints scheme (e.g., A, B, C, and so forth shown in figure 3). Specifically, Co_LAYER entity is the product of network overlay algorithm between “Layer” entity in the L&B database and “Section Description” entity in PIFs. TRB 2004 Annual Meeting CD-ROM Paper revised from original submittal. Jae-ho Choi and Teresa M. Adams 4 This network overlay process divides the network into smaller sections (e.g., 1, 2, 3, and so forth as shown in figure 4) where the attributes data are homogeneous. For example, section 1 is associated with PROJ_ID 1234 and PIFs 3722. This section can be associated with material and construction data through PROJ_ID 1234 and with performance measure through sequence number (SEQ) 3722. In order to relate traffic data to a corresponding record in Co_LAYER entity, traffic database needs to be overlaid on the Co_LAYER entity using network overlay technique. It requires converting reference points used in traffic database to milepoints before network overlay is conducted since traffic database uses reference point method (FROM_RP and TO_RP) to indicate the range of a traffic control section. This overlay process will add a new attribute (SITE_ID) which is the foreign key to traffic volume entity to a corresponding record in Co_LAYER entity. This will produce a new entity named Co_LAYER_II whose attributes, therefore, include PROJ_ID, SEQ, SITE_ID, FROM_RP, and TO_RP. Entity-Relationship (E-R) Model for PPAS Based on the E-R representation of each database and files (4), a conceptual E-R model for developing the PPAS was developed as shown in figure 5. The entity “Co_Layer_II” is the product resulting from the network overlay processes of “Layer (Layer shown in figure 4)” entity in the L&B database and “Section Description (SECTION shown in figure 5)” entity in PIFs and “SITE” entity in traffic database. There are several entities that can be combined into one entity in figure 5. For example, SECTION, LAYER and Co_LAYER_II entities can be combined into one big denormalized table since minimizing Structural Query Language (SQL) joins is a good way to improve performance execution time. As was indicated before, material data including design JMF, QC/QA, and aggregate data can be associated with the pavement structural data in the L&B database through a project identification number (PROJ_ID). Figure 5 shows that they could have the same entity table named “TEST_NO” indicating that material data entities such as QC/QA, JMF, AGGREGATE entities can be related to TEST_NO entity through the attribute TEST_NO. DIMENSIONAL MODELS FOR DATA WAREHOUSE DESIGN It is required to select an appropriate database model before implementing database template for PPAS. In this section, the dimensional model, which is an alternative to conventional E-R model, was developed since the dimensional model has a number of important data warehouse advantages that the E-R model lacks. Dimensional modeling (DM) is the name of a logical design technique often used for data warehouse that seeks to present the data in a standard framework that is intuitive and allows for high-performance access (5). A data warehousing is an architecture for integrating data resources to be used for reporting and decision support initiatives (6). The data within the data warehouse is accumulated in multidimensional data structures to support direct querying and multidimensional analysis in which the output data can be seen as depending on several independent variables. It is known that E-R model is normalized to remove all redundancy and structure the inter-relationship between data. Its structure is more efficient in transaction-oriented business. A dimensional model contains the same information as an E-R model but packages the data in a symmetric format whose design goals are understandability, query performance, and resilience to change. The predictable framework of a dimensional model can make the user interface more understandable, and processing more efficient. The symmetric structure of a dimensional model can accommodate unexpected new data elements without changing design decision (5). Every dimensional model is composed of one table with a multipart key, called the “fact” table, and a set of smaller tables called “dimension” tables. A fact table is the primary table in each dimensional model that is meant to contain measurements of the business. Each dimension table has a single-part primary key that corresponds exactly to one of the components of the multipart key in the fact table. A fact table is surrounded by a halo of dimension tables, where the dimension keys are primary keys in each respective dimension table. In this study, fact tables represent a many-to-many relationship with respect to several dimension tables and are only used for modeling the relationship between performance measures and input material and construction variables. In other words, most dimension tables contain many textural attributes (fields) that are the basis for constraining and grouping within data warehouse queries. End-users can always create a row header in a report by TRB 2004 Annual Meeting CD-ROM Paper revised from original submittal. Jae-ho Choi and Teresa M. Adams 5 dragging a dimension attribute from any of the dimension tables down into the spreadsheet, thereby making a row header. Data Warehouse Bus Architecture for PPAS On the basis of defining the important objectives of the PPAS, an overview of the pavement performance analysis system, called the data warehouse bus architecture, was deigned as shown in Table 2. The data warehouse bus architecture is a matrix that shows dimension tables in columns and fact tables (data marts) as rows. The matrix shows which dimension tables are used in which fact tables. Song et al. (7) explain that the data warehouse bus architecture was proposed to standardize the development of data marts into an enterprise-wide data warehouse, rather than to cause stovepipe data marts that were unable to be integrated into a whole. As shown in Table 2, the three data marts are combined into an integrated data warehouse through the conformed dimension tables. The dimensions used in multiple data marts are called conformed dimensions (5). Material testing, QC/QA, and aggregate consensus dimension tables are the examples of the conformed dimension tables. Conformed dimension tables make it possible that a single dimension table be used against multiple fact tables in the same database space (5). Fact Table Design In order to develop a fact table based on the data warehouse bus architecture for PPAS, the network overlay algorithm should be implemented on the attribute data on L&B database, PIFs and traffic database to be able to integrate the extracted datasets. This network overlay process divides the network into smaller sections where the attributes data are homogeneous and generates the new entity table (i.e., Layer Fact Table shown in figure 6 (a)), which is required for modeling the relationship between the layer fact table and dimension tables. The attributes (i.e., HWY, FROM_RP, and TO_RP) in layer fact table define one spatially divided pavement segment. The attributes data in denormalized dimension tables are related to the layer fact table through multipart keys (e.g., LAYER_ID, PROJ_ID, and SEQ). However, the network overlay process is not used to develop base and subbase fact tables because these two sub-layers are not directly related with the performance measures. Therefore, the SEQ attribute in PIFs is not used in base and subbase fact tables as shown in figure 6 (b) and (c). These two fact tables are only used to provide the sub-structure information of a pavement segment specified in layer fact table and can be obtained by dividing each base and subbase master files in L&B database into two parts: key groups (BASE_ID, PROJ_ID) and attributes (HWY, FROM_RP, and TO_RP). Those two sub-layers’ information can be provided to end-users through the conformed dimension tables, which enable each fact table to be related to other fact tables. A set of attributes, HWY, FROM_RP, and TO_RP in fact tables, are called spatial-enabling data (7). Using a Geographic Information System (GIS) tool could greatly extend the capabilities of this spatially capable data warehouse by allowing the users to view and analyze data using geo-analytical tools. This requires the addition of spatial data (e.g., map data) to the warehouse environment. However, these spatial data will not be fully integrated with the data warehouse, but rather will be managed by GIS (7). A tight coupling of a GIS and a data warehouse will permit managerial personnel to acquire access to relevant geographical information stored in the appropriate linked geographical data store. Complete Dimensional Models Figure 6 shows the complete dimensional models centered on layer fact table, base fact table, and subbase fact table. As was illustrated before, every fact table represents a many-to-many relationship and contains a set of two or more foreign keys that join to their respective dimension tables. Each dimension is defined by its primary key that serves as the basis for referential integrity with any given fact table to which it is joined. Most dimension table contains many textual attributes (fields) that are the basis for constraining and grouping within data warehouse queries. In the complete dimensional models, the material testing dimension can be combined with the QC/QA dimension and aggregate source dimension because they all have the same PROJ_ID attribute as a primary key. In this case, although this new attempted dimension would contain exactly the same information as the proposed one in figure 6, it is recommended to reject this design immediately because the new larger dimension would be unwieldy and would present performance problems and no user interface advantages (5). This also makes data TRB 2004 Annual Meeting CD-ROM Paper revised from original submittal. Jae-ho Choi and Teresa M. Adams 6 uploading easier because it does not require any pre-transformation or combining process prior to uploading to data warehouse. Adding QC/QA and Traffic Subdimensions to Layer Dimension If the user wants the specified section FROM_RP to TO_RP in Layer Fact table, the system only needs to provide the QC/QA data within that section. The TO_RP in QC/QA dimension of figure 7 is the location where a split sample is collected and a QC/QA data of that sample is recorded. The list of the QC/QA dimension whose TO_RP is less than the TO_RP in the Layer FACT table is the QC/QA attribute data of the specified pavement section end-user is interested in. The dimensional models require a slight change into snow-flaked model in the QC/QA dimension. The snow-flaked is a data model that is highly normalized with many small entities, taken to fourth or fifth normal form, such that, when laid out visually, the model looks like a snowflake (5). The reason that snow-flaked QC/QA subdimension table is urged in this study is because of the different level of granularity than the primary QC/QA dimension table. The possible approach to assign traffic data to the proposed PPAS is to build a subdimension that has the appearance of a snow-flaked table as shown in figure 8. The traffic subdimension is a set of traffic attributes that have been measured for the segments that a PIFs section belongs to. Several PIF sections may share this identical set of traffic attributes because one traffic dimension can include several PIFs sections. It makes sense to isolate this traffic data in a snow-flaked subdimension since the traffic data is available at a significantly different detail and is administered and loaded at different times than the rest of the data in the Section dimension. CONCLUDING REMARKS Current WisDOT’s pavement management information systems although are of high quality are deficient from an analytical viewpoint because they limit the analysis capability to determine the relationship between performance measures of a structure and the design and construction inputs. Therefore, the formulation and presentation of the strategy for relating the datasets extracted from source databases and files spatially using the network overlay method was presented to develop a conceptual database model (E-R model) required for implementing pavement performance analysis system. In addition, this study presented the dimensional model, which is an alternative to conventional E-R model, since the dimensional model has a number of important data warehouse advantages that the E-R model lacks. E-R model is mainly used to modeling the micro-scopic relationship between data and overly complex to the practitioners in the state and local government. On the other hand, a dimensional model contains the same information as an E-R model but packages the data in a symmetric format, thus making the user interface more understandable, and processing more efficient. It is believed that the proposed dimensional models can facilitate the proposed modeling framework by meeting the selected important objectives of the PPAS. To our knowledge, this is the first detailed dimensional data warehouse model specifically targeted for state agencies for improving pavement performance analysis. Next steps will be oriented at developing the software component which interconnects the two technologies: dimensional models for the PPAS data warehousing project and geographic information system. ACKNOWLEDGEMENTS The authors would like to thank Judith Ryan and David Friedrichs at Truax center at WisDOT for the information and comments for this study. REFERENCES 1. Nyerges T. L. Locational Referencing and Highway Segmentation in a Geographic Information System. ITE Journal, Vol. 27, 1990. 2. Quality Management Program Guide/Procedure Manual, Wisconsin Department of Transportation, Division of Transportation Infrastructure Development, Bureau of Highway Construction. 1999. TRB 2004 Annual Meeting CD-ROM Paper revised from original submittal. Jae-ho Choi and Teresa M. Adams 7 3. Schmit R. L. Development of Statistically-Based Methods to Determine QC/QA Testing Levels for Hot-Mix Asphalt Construction, PhD thesis. Div. Of Const. Engr. And Mange, University of Wisconsin, Madison, USA, 1998. 4. Choi Jae-ho. Framework of Developing Performance-related Specifications at WisDOT, PhD thesis. Div. Of Const. Engr. And Mange, University of Wisconsin, Madison, USA, 2002. 5. Kimball Ralph, Reeves Laura, Ross Margy, and Thornthwaite Warren. The Data Warehouse, Life Cycle Toolkit, expert methods for designing, developing, and deploying data warehouses. Wiley, Inc., New York, 1998. 6. Matthias et al. Fundamentals of Data Warehouses, Springer, New York, 2000. 7. Song Il-yeol, Kelly Levan-Shultz. Data Warehouse Design for E-Commerce Environments. Lecture notes in computer science. ISSUE 1727, Springer Verlag, New York, 1999. 8. Mennecke, B., Higgins, G. Spatial Data in the Data Warehouse: A Nomenclature for Design and Use. Proceedings of the Americas conference on information systems. Americas conference 5, 1999 Aug, Milwaukee, WI. TRB 2004 Annual Meeting CD-ROM Paper revised from original submittal. Jae-ho Choi and Teresa M. Adams 8 LIST OF TABLES AND FIGURES TABLE 1 Summary of Location Reference Schemes of Each Database TABLE 2 Data Warehouse Bus Architecture for PPAS FIGURE 1 Overview of the WisDOT’s pavement performance analysis system. FIGURE 2 Comparison of as-constructed QC/QA and field performance (Schmit, 1998). FIGURE 3 Schematic for relating components datasets. FIGURE 4 An Example of a network overlay between L&B database and PIFs. FIGURE 5 A conceptual database model (E-R model) for developing PPAS. FIGURE 6 Complete dimensional models for pavement performance analysis system. FIGURE 7 Snow-flaked tables in QC/QA dimension. FIGURE 8 Snow-flaked tables in traffic dimension. TRB 2004 Annual Meeting CD-ROM Paper revised from original submittal. Jae-ho Choi and Teresa M. Adams 9 Table 1 Summary of Location Reference Schemes of Each Database Database Location Reference Scheme L&B FROM_RP and TO_RP PIF FROM_RP, FRPM_PLUS, TO_RP, and TO_PLUS QC/QA Project ID, Quantity of material placed
منابع مشابه
Development of An Artificial Neural Network Model for Asphalt Pavement Deterioration Using LTPP Data
Deterioration models are important and essential part of any Pavement Management System (PMS). These models are used to predict future pavement situation based on existence condition, parameters causing deterioration and implications of various maintenance and rehabilitation policies on pavement. The majority of these models are based on roughness which is one of the most important indices in p...
متن کاملInvestigating the Performance of Cracked Asphalt Pavement Using Finite Elements Analysis
Occurrence of top down and bottom up fatigue cracking in asphaltic pavements is common. Conventional pavement analysis methods ignore the existence of cracks in asphaltic layers. However, it seems that the responses of cracked pavement would not be the same as a pavement without crack. This paper describes effects of crack type, position and length, and vehicles tire inflation pressure and axle...
متن کاملAsphalt Pavement Performance Model of Airport Using Microwave Remote Sensing Satellite
The purpose of this study is to build the binary logit model of an airport pavement that could monitor the pavement condition in near real time using microwave remote sensing satellite, then the relationship between the international roughness index (IRI) of an airport and backscattering values from PALSAR images of the ALOS satellite was determined. Total 390 data were used in analysis. This m...
متن کاملPrediction of The Pavement Condition For Urban Roadway A Tehran Case Study (RESEARCH NOTE)
This report is the result of a research project on a pavement management system that was preformed by the Transportation Division of Iran University of Science and Technology. Information used in the project was collected from 20 zones of the Tehran Municipality. Any maintenance and repair system for roads is normally compared of a number of general and coordinated activities in conjunction wit...
متن کاملAn Improved Modular Modeling for Analysis of Closed-Cycle Absorption Cooling Systems
A detailed modular modeling of an absorbent cooling system is presented in this paper. The model including the key components is described in terms of design parameters, inputs, control variables, and outputs. The model is used to simulate the operating conditions for estimating the behavior of individual components and system performance, and to conduct a sensitivity analysis based on the give...
متن کامل